# Lightweight Deployment

Midm 2.0 Base Instruct Gguf
MIT
Mi:dm 2.0 is an 'AI centered around South Korea' model developed using KT's proprietary technology, which deeply internalizes the unique values, cognitive frameworks, and common-sense reasoning of South Korean society.
Large Language Model Transformers Supports Multiple Languages
M
mykor
517
1
Qari OCR 0.3 SNAPSHOT VL 2B Instruct Merged GGUF
This is a statically quantized version based on the Qari-OCR-0.3-SNAPSHOT-VL-2B-Instruct-merged model, mainly used for image-to-text conversion tasks.
Image-to-Text Transformers English
Q
mradermacher
188
0
Devstral Small 2505 GGUF
Apache-2.0
An efficient language model specifically designed for software engineering projects, featuring a lightweight design and supporting a 128k large context window, suitable for complex coding tasks.
Large Language Model Supports Multiple Languages
D
Mungert
1,409
1
Nvidia.cosmos Reason1 7B GGUF
Cosmos-Reason1-7B is a 7B-parameter foundational model released by NVIDIA, specializing in image-to-text tasks.
Large Language Model
N
DevQuasar
287
1
Devstral Small 2505 GGUF
Apache-2.0
Quantized version of Devstral-Small-2505, offering multiple precision options to adapt to different hardware requirements
Large Language Model Supports Multiple Languages
D
Antigma
170
1
Unsloth.devstral Small 2505 GGUF
Devstral-Small-2505 is a small language model based on the Mistral architecture, supporting text generation tasks and capable of basic visual functions through compatible mmproj files.
Text-to-Image
U
DevQuasar
949
1
Devstral Small 2505 Fp8
Apache-2.0
Devstral is a large language model agent for software engineering tasks developed by Mistral AI in collaboration with All Hands AI, excelling in exploring codebases with tools, editing multiple files, and driving software engineering agents.
Large Language Model Safetensors Supports Multiple Languages
D
bullerwins
243
1
Devstral Small 2505 GGUF
Apache-2.0
Devstral is an intelligent LLM specifically designed for software engineering tasks, jointly developed by Mistral AI and All Hands AI. It excels in code exploration, multi-file editing, and driving software engineering agents.
Large Language Model Supports Multiple Languages
D
unsloth
72.26k
64
Devstral Small 2505
Apache-2.0
Devstral is an intelligent large language model specifically designed for software engineering tasks, jointly developed by Mistral AI and All Hands AI. It excels in code exploration, multi-file editing, and driving software engineering agents.
Large Language Model Safetensors Supports Multiple Languages
D
unsloth
317
11
Devstral Small 2505 Unsloth Bnb 4bit
Apache-2.0
Devstral is a large language model for software engineering task agents, developed in collaboration between Mistral AI and All Hands AI. It excels at using tools to explore codebases, edit multiple files, and drive software engineering agents.
Large Language Model Safetensors Supports Multiple Languages
D
unsloth
873
3
Devstral Small 2505 Bnb 4bit
Apache-2.0
Devstral is an intelligent large language model specifically designed for software engineering tasks, developed in collaboration by Mistral AI and All Hands AI. It excels in codebase exploration, multi-file editing, and driving software engineering agents.
Large Language Model Safetensors Supports Multiple Languages
D
unsloth
465
3
Devstral Small 2505 Gguf
Apache-2.0
Devstral is an intelligent large language model specifically designed for software engineering tasks, jointly developed by Mistral AI and All Hands AI. It excels in code exploration, editing, and driving software engineering agents.
Large Language Model Supports Multiple Languages
D
mistralai
8,964
44
Sam Reason S2.1 GGUF
MIT
Static quantized version of Sam-reason-S2.1, offering multiple quantization options to suit different hardware requirements
Large Language Model English
S
mradermacher
299
1
Qwen2 VL OCR 2B Instruct GGUF
Apache-2.0
A multimodal model fine-tuned based on Qwen/Qwen2-VL-2B-Instruct, optimized for OCR, image-to-text conversion, LaTeX math solving, and handwriting recognition
Image-to-Text Supports Multiple Languages
Q
prithivMLmods
142
1
Llava 1.5 7b Hf Q4 K M GGUF
This model is a GGUF format conversion of llava-hf/llava-1.5-7b-hf, supporting image-to-text generation tasks.
Image-to-Text English
L
Marwan02
30
1
Ten Vad
Apache-2.0
TEN VAD is a low-latency, lightweight, and high-performance streaming voice activity detection system, suitable for real-time voice processing scenarios.
Speech Recognition Other
T
TEN-framework
16
29
Devstral Small 2505
Apache-2.0
Devstral is an intelligent large language model developed by Mistral AI in collaboration with All Hands AI for software engineering tasks, excelling in codebase exploration, multi-file editing, and driving software engineering agents.
Large Language Model Safetensors Supports Multiple Languages
D
mistralai
102.17k
601
INTELLECT 2 GGUF
INTELLECT-2-GGUF is the GGUF format quantized version of PrimeIntellect/INTELLECT-2, suitable for text generation tasks.
Large Language Model
I
MaziyarPanahi
88
1
Ace Gguf
Apache-2.0
ACE-Step-v1-3.5B is a text-to-audio model that supports high-quality audio generation, suitable for music and sound effects creation.
Audio Generation
A
calcuis
1,332
12
Qwen2.5 7b SFT Three Subtasks 3epoch
This is a model based on the 🤗 transformers library, with specific functions and purposes not yet clearly stated.
Large Language Model Transformers
Q
mjfmark
97
1
Openvision Vit Huge Patch14 84
Apache-2.0
OpenVision is a fully open, cost-effective family of advanced vision encoders designed for multimodal learning.
Image Classification Transformers
O
UCSC-VLAA
19
0
Openvision Vit Base Patch8 224
Apache-2.0
OpenVision is a fully open, cost-effective family of advanced visual encoders focused on multimodal learning.
Image Classification
O
UCSC-VLAA
43
0
Openvision Vit Tiny Patch8 384
Apache-2.0
OpenVision is a fully open, cost-effective advanced visual encoder family focused on multimodal learning.
Image Enhancement Transformers
O
UCSC-VLAA
16
0
Parakeet Tdt 0.6b V2 Mlx
This is an automatic speech recognition model that has been converted to a version suitable for MLX and can perform inference quickly.
Speech Recognition Safetensors English
P
senstella
183
6
Allenai.olmo 2 0425 1B Instruct GGUF
OLMo-2-0425-1B-Instruct is a 1-billion-parameter instruction-finetuned language model developed by AllenAI, focused on text generation tasks.
Large Language Model
A
DevQuasar
220
1
Mlabonne Qwen3 4B Abliterated GGUF
Quantized version of Qwen3-4B-abliterated, quantized using llama.cpp, supports multiple quantization types, suitable for text generation tasks.
Large Language Model
M
bartowski
3,623
3
Josiefied Qwen3 4B Abliterated V1 Gguf
Apache-2.0
This is the GGUF quantized version of the Josiefied-Qwen3-4B-abliterated-v1 model, suitable for local deployment and execution.
Large Language Model
J
Goekdeniz-Guelmez
4,518
7
Quantized Dia 1.6B Int8
Apache-2.0
Dia is a 1.6 billion parameter open-source text-to-speech model that supports highly realistic dialogue and non-verbal expression generation
Speech Synthesis Supports Multiple Languages
Q
RobAgrees
69
0
Jungzoona T3Q Qwen2.5 14b V1.0 E3 GGUF
Apache-2.0
This repository contains GGUF format model files of JungZoona/T3Q-qwen2.5-14b-v1.0-e3, quantized by TensorBlock's machine and compatible with llama.cpp.
Large Language Model Transformers Supports Multiple Languages
J
tensorblock
557
1
Dia 1.6B
Apache-2.0
Dia is an open-weight text-to-dialogue model that supports dialogue text generation and speech synthesis.
Speech Synthesis English
D
mlx-community
370
12
Huihui Ai.glm 4 9B 0414 Abliterated GGUF
GLM-4-9B-0414-abliterated is a large language model with 9B parameters based on the GLM architecture, suitable for text generation tasks.
Large Language Model
H
DevQuasar
3,172
3
Google Gemma 3 4b It Qat GGUF
A quantized version of Google's Gemma 3B model based on QAT weights, supporting multiple quantization levels for efficient inference in resource-constrained environments.
Large Language Model
G
bartowski
4,538
4
Llama 3.2 11B Vision Radiology Mini
This is a multimodal model based on the Llama architecture, supporting vision and text instructions, optimized with 4-bit quantization.
Image-to-Text
L
p4rzvl
69
0
Llama381binstruct Summarize Short Merged
Other
A merged model based on Meta-Llama-3.1-8B-Instruct, fine-tuned for legal summarization tasks, capable of converting legal terminology into concise and understandable summaries.
Large Language Model
L
FlamingNeuron
42
0
Granite 3.3 8b Instruct Q8 0 GGUF
Apache-2.0
This model is a GGUF format model converted from the IBM Granite-3.3-8B instruction fine-tuned model, suitable for text generation tasks.
Large Language Model
G
NikolayKozloff
36
2
Gemma 3 12b It Qat 8bit
Other
An 8-bit quantized version converted from the Google Gemma 3 12B model, suitable for image-text to text tasks.
Image-to-Text Transformers Other
G
mlx-community
149
1
Gemma 3 12b It Qat 3bit
Other
This is an MLX-format model converted from the Google Gemma 3-12B model, supporting image-text-to-text tasks.
Image-to-Text Transformers Other
G
mlx-community
65
1
Salesforce.llama Xlam 2 8b Fc R GGUF
Salesforce's 800M parameter Llama-xLAM-2 model quantized version, specialized in text generation tasks
Large Language Model
S
DevQuasar
286
1
Gemma 3 1b It Qat Q4 0 Unquantized
Gemma 3 is a lightweight open-source multimodal model series developed by Google, built on Gemini technology, supporting text and image inputs with text outputs. The 1B version has undergone instruction tuning and quantization-aware training (QAT), making it suitable for deployment in resource-constrained environments.
Image-to-Text Transformers
G
google
246
4
GLM Z1 9B 0414
MIT
GLM-4-Z1-9B-0414 is the latest open-source model in the GLM family, featuring excellent mathematical reasoning and general capabilities, suitable for lightweight deployment in resource-constrained scenarios.
Large Language Model Transformers Supports Multiple Languages
G
THUDM
3,456
55
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase